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Abstract 

We propose a new framework for cooperative spectrum sensing in cognitive radio networks, that 
is based on a novel class of non-uniform samplers, called the event-triggered samplers, and sequential 
detection. In the proposed scheme, each secondary user computes its local sensing decision statistic based 
on its own channel output; and whenever such decision statistic crosses certain predefined threshold 
values, the secondary user will send one (or several) bit of information to the fusion center. The fusion 
center asynchronously receives the bits from different secondary users and updates the global sensing 
decision statistic to perform a sequential probability ratio test (SPRT), to reach a sensing decision. 
We provide an asymptotic analysis for the above scheme, and under different conditions, we compare 
it against the cooperative sensing scheme that is based on traditional uniform sampling and sequential 
detection. Simulation results show that the proposed scheme, using even 1 bit, can outperform its uniform 
sampling counterpart that uses infinite number of bits under changing target error probabilities, SNR 
values, and number of SUs. 

I. Introduction 

Spectrum sensing is one of the most important functionalities in a cognitive radio system 0]], by which 
the secondary users (SU) decide whether or not the spectrum is being used by the primary users. Various 
spectrum sensing methods have been developed based on exploiting different features of the primary 
user's signal [2]. On the other hand, cooperative sensing, where multiple secondary users monitor the 
spectrum band of interest simultaneously and cooperate to make a sensing decision, is an effective way 
to achieve fast and reliable spectrum sensing If3ll — iTTTl . 

In cooperative sensing, each secondary user collects its own local channel statistic, and sends it to a 
fusion center (FC), which then combines all local statistics received from the secondary users to make a 
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final sensing decision. The decision mechanism at the FC can be either sequential or fixed sample size. 
In other words, the FC can either try to make a decision every time it receives new information or it can 
wait to collect a specific number of samples and then make a final decision using them. It is known that 
sequential methods are much more effective in minimizing the decision delay than their fixed sample 
size counterparts. In particular, the sequential probability ratio test (SPRT) is the dual of the fixed sample 
size Neyman-Pearson test, and it is optimal among all sequential tests in terms of minimizing the average 
sample number (decision delay) for i.i.d. observations (H, O- Sequential approaches to spectrum sensing 
have been proposed in a number of recent works |[T0l - |[T5l . 

The majority of existing works on cooperative and sequential sensing assume that the SUs syn- 
chronously communicate to the FC. This implies the existence of a global clock according to which SUs 
sample their local test statistics using conventional uniform sampling. There are a few works allowing 
for asynchrony among SUs (e.g., ITT31 . lfl4l ). but none of them provides an analytical discussion on 
the optimality or the efficiency of the proposed schemes. In this paper, we develop a new framework 
for cooperative sensing based on a class of non-uniform samplers called the event-triggered samplers, 
in which the sampling times are determined in a dynamic way by the signal to be sampled. Such a 
sampling scheme naturally outputs low -rate information (e.g., 1 bit per sample) without performing 
any quantization, and permits asynchronous communication between the SUs and the FC |fT6l . Both 
features are ideally suited for cooperative sensing in cognitive radio systems since the control channel for 
transmitting local statistics has a low bandwidth and it is difficult to maintain synchrony among the SUs. 
Moreover, we will show that by properly designing the operations at the SUs and FC, the cooperative 
sensing scheme based on event-triggered sampling can outperform the one based on the conventional 
uniform sampling. 

The remainder of the paper is organized as follows. In Section II we describe the cooperative spectrum 
sensing problem, in both centralized and decentralized form and we outline three spectrum detectors. In 
Section III we introduce the decentralized spectrum sensing approach based on event-triggered sampling 
and discuss the operations at both the SUs and the FC. In Section IV we perform a comprehensive 
asymptotic performance analysis on the proposed spectrum sensing method, and the one based on 
conventional uniform sampling. Simulation results are provided in Section V. Finally, Section VI concludes 
the paper. 
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II. Problem Formulation and Background 
A. Spectrum Sensing via SPRT 

Consider a cognitive radio network where there are K secondary users performing spectrum sensing 
and dynamic spectrum access. Let {y^}, t 6 N, be the Nyquist-rate sampled discrete-time signal observed 
by the fcth SU, which processes it and transmits some form of local information to a fusion center. Using 
the information received at the fusion center from the K SUs, we are interested in deciding between two 
hypotheses, Ho and Hi, for the SU signals: i.e., whether the primary user (PU) is present (Hi) or not 
(Ho). Specifically, every time the fusion center receives new information, it performs a test and either 
1) stops accepting more data and decides between the two hypotheses; or 2) postpones its decision until 
a new data sample arrives from the SUs. When the fusion center stops and selects between the two 
hypotheses, the whole process is terminated. 

Note that the decision mechanism utilizes the received data sequentially as they arrive at the fusion 
center. This type of test is called sequential as opposed to the conventional fixed sample size test in which 
one waits until a specific number of samples has been accumulated and then uses them to make the final 
hypothesis selection. Since the pioneering work of Wald |8 ], it has been observed that sequential methods 
require, on average, approximately four times ifTTl Page 109] less samples (for Gaussian signals) to reach 
a decision than their fixed sample size counterparts, for the same level of confidence. Consequently, 
whenever possible, it is always preferable to use sequential over fixed sample size approaches. 

Assuming independence across the signals observed by different SUs, we can cast our problem of 
interest as the following binary hypothesis testing problem 

h : {^...^Wo 1 ; {yi ) ---,yf}~/o 2 ;---;{yf,---,yf}~/(f 

(i) 

Hi : {yl, ... ,yl} ~ f\- {yf, . . . ,y t 2 } ~ jf; . . . ; {yf , ... ,yf } ~ /f , 

where ~ denotes "distributed according to" and /q and are the joint probability density functions 
of the received signal by the kth SU, under Ho and Hi respectively. Since we assume independence 
across different SUs the log-likelihood ratio (LLR) L t of all the signals received up to time t, which is 
a sufficient statistic for our problem, can be split as 

K 

Lt = Y. Lk t (2) 

k=l 

where L\ represents the local LLR of the signal received by the /cth SU, namely 

L^tog ffii'-"'^ , (3) 
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Hence each SU can compute its own LLR based on its corresponding observed signal, and send it to the 
fusion center which collects them and computes the global cumulative LLR Lt using (0. Note that the 
local LLRs can be obtained recursively. That is, at each time t, the new observation y\ gives rise to an 

LLR increment l\, and the local cumulative LLR can then be updated as 

t 

L« = Lt 1 + l* = J2& (4) 

n=l 



where 



and fi(yt\yi, ■ ■ ■ ,Ut-i) denotes the conditional pdf of y\ given the past (local) signal samples under 
hypothesis Hj. Of course when the samples of the received signal in each SU are also i.i.d., that is, we 
have independence across time, then the previous expression simplifies considerably and we can write 
q = log ^(^1) where now represents the pdf of a single sample in the fcth SU under hypothesis Hj. 

As we mentioned, the fusion center collects the local LLRs and at each time instant t is faced with a 
decision, namely to wait for more data to come, or to stop receiving more data and select one of the two 
hypotheses Ho, Hi. In other words the sequential test consists of a pair (T,8f) where T is a stopping 
time that decides when to stop (receiving more data) and Sj- a selection rule that selects one of the two 
hypotheses based on the information available up to the time of stopping T ■ 

Of course the goal is to make a decision as soon as possible which means that we would like to 
minimize the delay T, on average, that is, 

minEo[7~], and/or minEi[7~]. (6) 

T,<5r T,8 t 

At the same time we would also like to assure the satisfactory performance of the decision mechanism 
through suitable constraints on the Type-I and Type-II error probabilities, namely 

P (,5 r = 1) < a and Pi(S T = 0) < /3, (7) 

where Pj(-),Ej[-], i = 0, 1 denote probability and the corresponding expectation under hypothesis i. 
Levels a, (3 G (0, 1) are parameters specified by the designer. 

Actually, minimizing in (© each Ej[T], % = 0, 1 over the pairs (7~, 8f) that satisfy the two constraints 
in (0, defines two separate constrained optimization problems. However, Wald first suggested [8] and 
then proved [9] that the Sequential Probability Ratio Test (SPRT) solves both problems simultaneously. 



fi(yt\yt---,yt-i) 
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SPRT consists of the pair (S, 5s) which is defined as follows 



S = -m£{t>0:L t £(-B,A)}, 5 S 



1, if L s > A 



(8) 



0, if L s < -B. 



In other words, at every time instant t, we compare the running LLR L t with two thresholds — B, A where 

A, B > 0. As long as L t stays within the interval (-B, A) we continue taking more data and update 
L t ; the first time L t exits (— B, A) we stop (accepting more data) and we use the already accumulated 
information to decide between the two hypotheses Ho, Hi. If we call S the time of stopping (which is 
clearly random, since it depends on the received data), then when L$ > A we decide in favor of Hi, 
whereas if Ls < — B in favor of Ho- The two thresholds A, B are selected through simulations so that 
SPRT satisfies the two constraints in d7) with equality. This is always possible provided that a + j3 < 1. 
In the opposite case there is a trivial randomized test that can meet the two constraints without taking 
any samples (delay equal to 0). Note that these simulations to find proper values for A, B are performed 
once offline, i.e. before the scheme starts, for each sensing environment. 

The popularity of SPRT is due to its simplicity, but primarily to its very unique optimality properties. 
Regarding the latter we must say that optimality in the sense of ©, (O is assured only in the case of 
i.i.d. observations. For more complex data models, SPRT is known to be only asymptotically optimum. 

SPRT, when employed in our problem of interest, exhibits two serious practical weaknesses. First the 
SUs need to send their local LLRs to the fusion center at the Nyquist-rate of the signal; and secondly, 
the local LLR is a real number which needs infinite (practically large) number of bits to be represented. 
These two problems imply that a substantial communication overhead between the SUs and the fusion 
center is incurred. In this work, we are interested in decentralized schemes by which we mean that the 
SUs transmit low-rate information to the fusion center. 

B. Decentralized Q-SPRT Scheme 

A straightforward way to achieve low-rate transmission is to let each SU transmit its local cumulative 
LLR at a lower rate, say at time instants T, 2T, ...,mT, where the period T € N; and to quantize 
the local cumulative LLRs using a finite number of quantization levels. Specifically, during time instants 
(m — 1)T + l,...,mT, each SU computes its incremental LLR L k mT — L k of the observations 



y(m-l)T+V 



y k mT , to obtain 



mT 




(m-l)T 



(9) 



t=(m-l)T+l 
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where l\ is the LLR of observation y\, defined in ([5]). It then quantizes A^ T into A^ T using a finite 
number f of quantization levels. Although there are several ways to perform quantization, here we are 
going to analyze the simplest strategy, namely uniform quantization. We will also make the following 
assumption 

max < (f) < oo, (10) 

k,t 

stating that the LLRs of all observations are uniformly bounded by a finite constant <fi across time and 
across SUs. 

From (|9]) and ( fTOl ) we can immediately conclude that |A^ T | < T(f>. For our quantization scheme we 
can now divide the interval (— T</>, T</>) uniformly into f subintervals and assign the mid-value of each 
subinterval as the corresponding quantized value. Specifically we define 

r~(A^ T + T0) 2Tc/> 



r 



(11) 



2Tcj) 

These quantized values are then transmitted to the FC. Of course the SU does not need to send the actual 
value but only its index which can be encoded with log 2 f bits. 

The FC receives the quantized information from all SUs, synchronously, and updates the approximation 
of the global running LLR based on the information received, i.e. 

K 

L m T = £(m-l)T + /J ^mT- (12) 
fc=l 

Mimicking the SPRT introduced above, we can then define the following sequential scheme (S,5g), 
where 

( _ > f 1, if Le > A, 

S = TM; M=mf{m>0:L mT &(-B,A)\i 5* = < (13) 
L J ' [ 0, if < —B. 

Again, the two thresholds A, B are selected to satisfy the two error probability constraints with equality. 
We call this scheme the Quantized-SPRT and denote it as Q-SPRT. 

As we will see in our analysis, the performance of Q-SPRT is directly related to the quantization error 
of each SU. Since we considered the simple uniform quantization, it is clear that 

I \ k \k I . . 

l A mT ~ A mT\ < ~- ( 14 ) 

r 

We next consider three popular spectrum sensing methods and give the corresponding local LLR 
expressions. 
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C. Examples - Spectrum Sensing Methods 

Energy Detector: The energy detector performs spectrum sensing by detecting the primary user's 
signal energy. We assume that when the primary user is present, the received signal at the fc-th SU is 
yk _ x k _|_ w k^ w here x\ is the received primary user signal, and w\ ~ JV c (0,<t^) is the additive white 



where %| denotes a central chi-squared distribution with 2 degrees of freedom; and xiCfc) denotes a 
non-central chi-squared distribution with 2 degrees of freedom and noncentrality parameter 9%. 

Using the pdfs of central and non-central chi-squared distributions, we write the local LLR, l\, of the 
observations as follows 



where Io(x) is the modified Bessel function of the first kind and 0-th order. 

Spectral Shape Detector: A certain class of primary user signals, such as the television broadcasting 
signals, exhibit strong spectral correlation that can be exploited by the spectrum sensing algorithm |[T8l . 
The corresponding hypothesis testing then consists in discriminating between the channel's white Gaussian 
noise, and the correlated primary user signal. The spectral shape of the primary user signal is assumed 
known a priori, which can be approximated by a p-th order auto-regressive (AR) model. Hence the 
hypothesis testing problem can be written as 



where {wf}, {v%} are i.i.d. sequences with ~ A/" c (0, a^) and v% ~ A/" c (0, a%), while a\,. . . ,a p are 
the AR model coefficients. 

Using the Gaussian pdf the likelihoods under Hq and Hi can be easily derived. Then, accordingly the 





(16) 



Ho : Vt = 

Hi: y k t =Y:U*iyt-i + v k t 



(17) 
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local LLR of the sample received at time t at the fc-fh SU can be written as 



fi(yt\yt-v->yt- P ) 



i 



t exp - ± | y\ - Ya =1 a>iy k t -i 




log 



^exp(-4H^) 



1 



,fc|2 



1 V 2 0-2 

-2 % fe -y"«iyti +iog^. 



(18) 



Gaussian Detector: In general, when the primary user is present, the received signal by the A;-th SU 
can be written as y\ = h^st + w^, where ~ A/" c (0, pi) is the fading channel response between the 
primary user and the k-th secondary user; st is the digitally modulated signal of the primary user drawn 
from a certain modulation, with E[|s t | 2 ] = 1; and ~ A/" c (0, cr^) is the additive white Gaussian noise. 
It is shown in |[T9l that under both fast fading and slow fading conditions, spectrum sensing can be 
performed based on the following hypothesis testing between two Gaussian signals: 



In this article, we achieve the low-rate transmission required by the decentralized SPRT by adopting 
event-triggered sampling, that is, a sampling strategy in which the sampling times are dictated by the 
actual signal to be sampled, in a dynamic way and as the signal evolves in time. One could suggest to 
find the optimum possible combination of event-triggered sampling and sequential detection scheme by 
directly solving the double optimization problem defined in Q, f7]) over the triplet sampling, stopping 
time, and decision function. Unfortunately the resulting optimization turns out to be extremely difficult not 
accepting a simple solution. We therefore adopt an indirect path. In particular, we propose a decentralized 
spectrum sensing approach based on a simple form of event-triggered sampling, namely, the uniform 
level-triggered sampling. Then we show that the performance loss incurred by adopting this scheme as 
compared to the centralized optimum SPRT is insignificant. This clearly suggests that solving the more 
challenging optimization problem we mentioned before, produces only minor performance gains. 




(20) 



(19) 



III. Decentralized Spectrum Sensing via Level-triggered Sampling 
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A. Uniform Level-triggered Sampling at each SU 

Using uniform level-triggered sampling, each SU samples its local cumulative LLR process {L^ } at 
a sequence of random times {t^}, which is particular to each SU. In other words we do not assume 
any type of synchronization in sampling and therefore communication. The corresponding sequence of 
samples is \L\ k } with the sequence of sampling times recursively defined as follows 

t^inf{t>tt 1 :L t fc -4_ i 0(-A,A)}, t\ = 0, L k = 0. (21) 

where A is a constant. As we realize from (|2TI ) the sampling times depend on the actual realization of 
the observed LLR process and are therefore, as we pointed out, random. Parameter A can be selected 
to control the average sampling periods E,[^ — ££_]J, i = 0, 1. In principle we would like the two 
average periods to coincide to some prescribed value T. For simplicity we will assume that the LLR 
of each observation is symmetric around its mean under the two hypotheses. This guarantees that the 
two average periods under the two hypotheses are the same. However, we are not going to assume that 
the observations have the same densities across SUs. This of course will make it impossible to assure 
that all SUs will communicate with the FC with the same average period if we use the same A at each 
SU, a property which is practically very desirable. In Section HV-BI we propose a practically meaningful 
method to set this design parameter A in a way that assures a fair comparison of our method with the 
classical decentralized scheme, that is, Q-SPRT. 

What is interesting with this sampling mechanism is that it is not necessary to know the exact sampled 
value but only whether the incremental LLR L\ — L% crossed the upper or the lower threshold. This 
information can be represented by using a single bit. Denote with {b^} the sequence of these bits, where 
b\ = +1 means that the LLR increment crossed the upper boundary while b\ = — 1 the lower. In fact 
we can also define this bit as b\ = sign(A^) where A^ = L^ k — L^ k . 

We can now approximate the local incremental LLR as A^ = b^A, and since L^ k = Y^j=i we 
conclude that we can approximate the local LLR at the sampling times using the following equation 

n n 

L k = E X J = E b J A = 41, + & "A. (22) 

j=l 3=1 

Note that we have exact recovery, i.e., L^ k = L^ k , if the difference L\ — L^ k , at the times of sampling, 
hits exactly one of the two boundaries ±A. This is for example the case when \L\ } is a continuous-time 
process with continuous paths. 

The advantage of the level-triggered approach manifests itself if we desire to communicate the sampled 
information, as is the case of decentralized spectrum sensing. Indeed note that with classical sampling we 
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need to transmit, every T units of time, the real numbers L k mT (or their digitized version with fixed number 
of bits). On the other hand, in the level-triggered case, transmission is performed at the random time 
instants {t k } and at each t k we simply transmit the 1-bit value b\. This property of 1-bit communication 
induces significant savings in bandwidth and transmission power, which is especially valuable for the 
cognitive radio applications, where low-rate and low-power signaling among the secondary users is a key 
issue for maintaining normal operating conditions for the primary users. 

We observe that by using (01), we have L k — L k k = Y^j=t k +1 l j where, we recall, l\ is the 
(conditional) LLR of the observation y k at time t at the A;th SU defined in ©. Hence (|2TT > can be 
rewritten as 

t 

t k =mf{t>t k _ 1 : *;£(-A,A)}. (23) 

The level-triggered sampling procedure at each secondary user is summarized in Algorithm [TJ Until the 
fusion center terminates it, the algorithm produces the bit stream {b k t } based on the local cumulative 
LLR values L k at time instants {t k }, and sends these bits to the fusion center instantaneously as they 
are generated. 



lgorithm 1 The uniform level-triggered sampling procedure at the fc-th SU 
Initialization: t <— 0, n <— 

A <- 

while A G (-A, A) do 
t<-t + l 

Compute l\ [cf. Sec. IlLCl 
A «- A + I* 
end while 

n n + 1 

Send b k = sign(A) to the fusion center at time instant t„ 
Stop if the fusion center instructs so; otherwise go to line 2. 



Remarks: 

• Note that the level-triggered sampling naturally censors unreliable local information gathered at SUs, 
and allows only informative LLRs to be sent to the FC. 

• Note also that each SU essentially performs a local SPRT with thresholds ±A. The stopping times 
of the local SPRT are the inter-sampling intervals and the corresponding decisions are the bits {b k t } 
where b k = +1 and b k = — 1 favor Hi and Hq respectively. 
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B. Proposed Decentralized Scheme 

The bit streams {b„}k from different SUs arrive at the FC asynchronously. Using (O and d22l) . the 
global running LLR at any time t is approximated by 

K K 
fc=l fe=ln:t*<t 

In other words the FC adds all the received bits transmitted by all SUs up to time t and then normalizes the 
result with A. Actually the update of L t is even simpler. If {t n } denotes the sequence of communication 
instants of the FC with any SU, and {b n } the corresponding sequence of received bits then L tn = 
L tn _ 1 + b n A while the global running LLR is kept constant between transmissions. In case the FC 
receives more than one bit simultaneously (possible in discrete time), it processes each bit separately, as 
we described, following any random or fixed ordering of the SUs. 

Every time the global LLR process {L t } is updated at the FC it will be used in an SPRT-like test to 
decide whether to stop or continue (receiving more information from the SUs) and in the case of stopping 
to choose between the two hypotheses. Specifically the corresponding sequential test (S, 6g) is defined, 
similarly to the centralized SPRT and Q-SPRT, as 

r - 1 j 1. if La > A, 

S = ttf] N = M\n>0:L t g(-B,A)\; 6 S = { 5 (25) 

L J ^ ! 0, if L § < -B. 

S counts in physical time units, whereas M in number of messages transmitted from the SUs to the FC. 
Clearly d25l ) is the equivalent of ( fT3T ) in the case of Q-SPRT and expresses the reduction in communication 
rate as compared to the rate by which observations are acquired. In Q-SPRT the reduction is deterministic 
since the SUs communicate once every T unit times, whereas here it is random and dictated by the local 
level triggered sampling mechanism at each SU. The thresholds A, B, as before, are selected so that 
(S,5g) satisfies the two error probability constraints with equality. The operations performed at the FC 
are also summarized in Algorithm^ 

C. Enhancement 

A very important source of performance degradation in our proposed scheme is the difference between 
the exact value of L\ and its approximation L\ (see ifToll ). In fact the more accurately we approximate 
L\ the better the performance of the corresponding SPRT-like scheme is going to be. In what follows 
we discuss an enhancement to the decentralized spectrum sensing method described above at the SU 
and FC, respectively. Specifically, for the SU, we consider using more than one bit to quantize the 



May 17, 2012 



DRAFT 



12 



Algorithm 2 The SPRT-like procedure at the fusion center 

1: Initialization: L <— 0, n <— 

2: while L e (-B, A) do 
3: n <— n + 1 

4: Listen to the SUs and wait to receive the next bit &„ at time t n from some SU 

5: Z, L + 6„A 

6: end while 

7: Stop at time S = t n 

8: if L > A then 

9: 5j = 1 - the primary user is present 

10: else 

11: <jj = - the primary user is not present 

12: end if 

13: Inform all SUs the spectrum sensing result 



local incremental LLR values, while at the FC, we are going to use this extra information in a specific 
reconstruction method that will improve the approximation L\ and, consequently, the approximation of 
the global running LLR. We anticipate that this enhancement will induce a significant improvement in 
the overall performance of the proposed scheme by using only a small number of additional bits. Finally 
we should stress that there is no need for extra bits in the case of continuous-time and continuous-path 
signals since, as we mentioned, in this case L\ and L\ coincide. 

Overshoot Quantization at the SU: Recall that for the continuous-time case, at each sampling instant, 
either the upper or the lower boundary can be hit exactly by the local LLR, and therefore the information 
transmitted to the fusion center was simply a 1-bit sequence and this is sufficient to recover completely the 
sampled LLR using (l22l ). In discrete-time case, at the time of sampling, the LLR is no longer necessarily 
equal to the boundary since, due to the discontinuity of the discrete-time signal, we can overshoot 
the upper boundary or undershoot the lower boundary. The over(under)shoot phenomenon introduces a 
discrepancy in the whole system resulting in an additional information loss (besides the loss in time 
resolution due to sampling). Here we consider the simple idea of allowing the transmission of more 
than one bits, which could help approximate more accurately the local LLR and consequently reduce the 
performance loss due to the over(under)shoot phenomenon. 

Bit b\ informs whether the difference Ajj = L k tk —L^ k overshoots the upper threshold A or undershoots 
the lower threshold —A. Consequently the difference q k n = |A^| — A > 0, corresponds to the absolute 
value of the over(under)shoot. It is exactly this value we intend to further quantize at each SU. Note 
that q k cannot exceed in absolute value the last observed LLR increment, namely \l^ k \. To simplify our 
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analysis we will assume that < <j) < oo for all k,t as in (fTOl) . In other words the LLR of each 
observation is uniformly bounded across time and SUs. 

Since for the amplitude of the over(under)shoot we have < \l^ k | this means that < q* < 0. Let 
us now divide the interval [0, <ft), uniformly, into the following f subintervals [(m— 1), m)t, m = 1, . . . , f. 
Whenever q\ falls into one such subintervals the corresponding SU must transmit a quantized value q\ 
to the FC. Instead of adopting some deterministic strategy and always transmit the same value for each 
subinterval, we propose the following simple randomized quantization rule 

l-cxp(q*-(L4^J+l)f) 

i - exp( n " (26) 



L^J f, with probability p 



+ 1) f , with probability 1 - p = ^^J^ " ' 

Simply said, if q\ E [(m — l),m)i then we quantize q\ either with the lower or the upper end of 
the subinterval by selecting randomly between the two values. The quantized value q\ that needs to be 
transmitted to the FC clearly depends on the outcome of a random game and is not necessarily the same 
every time q\ falls into the same subinterval. Regarding the randomization probability p the reason it 
has the specific value depicted in d26l ) will become apparent in LemmaQ] 

If we have f subintervals then we transmit r + 1 different messages corresponding to the values 
mi, m = 0, . . . ,f. Combining them with the sign bit 6^ that also needs to be communicated to the FC, 
yields a total of 2(r + 1) possible messages requiring log 2 2(1 + f) = 1 + log 2 (l + r) bits for transmitting 
this information. It is clear that each SU needs to transmit only an index value since we assume that the 
FC knows the list of all 2(1 + f) quantized values. 

Modified Update at the FC: Let us now turn to the FC and see how the latter is going to use this 
additional information. Note that the kth SU, every time it samples, transmits the pair (b^,q^) where, 
we recall, the sign bit 6^ informs whether we overshoot the upper threshold A or undershoot the lower 
threshold —A and q^ the quantized version of the absolute value of the over(under)shoot. Consequently 
since we have = 6^(A + it is only natural to approximate the difference as follows 

A* = #(A + g*V (27) 



which leads to the following update of the local running LLR 

4\=% i+ ^(A + ^). (28) 

This should be compared with the simplified version (1221 ) where the term q\ is missing. It is exactly 
this additional term that increases the accuracy of our approximation and contributes to a significant 
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performance improvement in our scheme. Of course the update of the global running LLR is much 
simpler since the FC, if it receives at time t n information (b n , q n ) from some SU, then it will update its 
approximation of the global running LLR as follows 

L tn = L tn _ x + b n (A + q n ) . (29) 

The updated value will be held constant until the next arrival of information from some SU. 

For the SU operations given in Algorithm [T] only line 10 should be modified when multiple bits are 
used at each sampling instant, as follows 

10: Quantize <j£ as in d26l i and send to the fusion center at time £™. 

On the other hand, for the FC operations given in Algorithm [U lines 4 and 5 should be modified as 
follows 

4: Listen to the SUs and wait to receive the next message (b n ,q n ) from some SU. 
5: L <- L + 6„(A + q n ). 

With the proposed modification at each SU and at the FC we have in fact altered the communication 
protocol between the SUs and the FC and also the way the FC approximates the global running LLR. The 
final sequential test (S, 5g) however, is exactly the same as in (l25l ). We are going to call our decentralized 
test Randomized Level Triggered SPRT and denote it as RLT-SPRlQ. As we will demonstrate theoretically 
and also through simulations, the employment of extra bits in the communication between SUs and FC 
will improve, considerably, the performance of our test, practically matching that of the optimum. 

Let us now state a lemma that presents an important property of the proposed quantization scheme. 
Lemma 1. Let q\ be the (f + l)-level quantization scheme defined in (1261) for the overshoot q%, then 

E[e ±&£(A+<S)| 6 fc )(7 *] < e ±6S(A+ g ») = e ±{ - L \~ L \_ x \ (30 ) 

where E[-] denotes expectation with respect to the randomization probabilities. 

Proof: For given q%, q\ takes the two values defined in d26l ) with probability p and \ —p respectively. 
Define e = t, that is, the common length of the subintervals. Suppose that (m — l)e < q!^ < me, m = 
1, . . . , r then q\ takes the two end values with probabilities p, (1 — p) respectively, but let us consider p 
unspecified for the moment. We would like to select p so that 

±6»(A+(m-l)*) + M _ p \ e ±6*(A+me) < ^±5*(A+«*)_ ( 31) 

'in 1 16 1 the corresponding decentralized D-SPRT test that uses level triggered sampling at the sensors (that play the role of 
the SUs) is based only on 1-bit communication. 
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Since b\ is a sign bit this is equivalent to solving the inequality 

pe±(m-i)e + (! _ p ) e ±™ < e ±il j (32 ) 

from which we conclude that 

{e _m ^ — e _<? " e mg — e g ™ 1 
: -, ttt ■ ; -, ttt. > (33) 
g— me g— (m— l)e gme g(m— l)e 

It is straightforward to verify that the second ratio is the smallest of the two, consequently we define p 
to have this value which is the one depicted in (l26l ). ■ 
Note that the approximation in the incremental LLR L\ k — L^ k induces an equivalent approximation 
for the incremental LR exp(L^ fc — L^ k ). The randomization is selected so that the latter, in average 
(over the randomization), does not exceed the exact incremental LR. One could instead select p so that 
the average of the approximation of the incremental LLR matches the exact LLR value. Even though 
this seems as the most sensible selection, unfortunately, it leads to severe analytical complications which 
are impossible to overcome. The proposed definition of p, as we will see in the next section, does not 
have such problems. 

IV. Performance Analysis 

In this section, we provide an asymptotic analysis on the stopping time of the decentralized spectrum 
sensing method based on the level-triggered sampling scheme proposed in Section [Till and compare it 
with the centralized SPRT procedure given by ([8]). A similar comparison is performed for the conventional 
decentralized approach that uses uniform sampling and quantization [cf. (l9l).(fT2l)l. For our comparisons 
we will be concerned with the notion of asymptotic optimality for which we distinguish different levels 

na, m. 

Definition 1. Consider any sequential scheme (T, 8f) with stopping time T and decision function b-j 
satisfying the two error probability constraints Po(5j- = 1) < a and Pi(<5t = 0) < /?. If S denotes the 
optimum SPRT that satisfies the two error probability constraints with equality then, as the Type-I and 
Type-II error probabilities a, j3 — > 0, the sequential scheme (T, Sf) is said to be order-1 asymptotically 
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optimal if 



1 < 



Eiin 



1 + o a ,p(l); 



(34) 



order-2 asymptotically optimal if 



0< E,[T] 



Ei[5] =0(1); 



(35) 



and finally order-3, if 



0< EifTl-Eip] =o Q ,0(l) 



(36) 



where Pj(-) a«fi? E,[-] denote probability and the corresponding expectation under hypothesis Hj, i = 0, 1. 

Remark: In our definitions the left-hand side inequalities are automatically satisfied because 5 is the 
optimum test. Note that order-2 asymptotic optimality implies order- 1 because Ei[<S],Ej[T] -> oo as 
a, /3 — )• 0; the opposite is not necessarily true. Order-1 is the most frequent form of asymptotic optimality 
encountered in the literature but it is also the weakest. This is because it is possible Ej [7~] to diverge from 
the optimum Ej [S] without bound and still have a ratio that tends to 1 . Order-2 optimality clearly limits the 
difference to bounded values, it is therefore stronger than order-1. Finally the best would be the difference 
to become arbitrarily small, as the two error probabilities tend to 0, which is the order-3 asymptotic 
optimality. This latter form of asymptotic optimality is extremely rare in the Sequential Analysis literature 
and corresponds to schemes which, for all practical purposes, are considered as optimum per se. 

Next we study the three sequential tests of interest, namely the optimum centralized SPRT, the Q-SPRT 
and the RLT-SPRT and compare the last two with the optimum in order to draw conclusions about their 
asymptotic optimality. We start by recalling from the literature the basic results concerning the tests of 
interest in continuous time. Then we continue with a detailed presentation of the discrete-time case where 
we analyze the performance of Q-SPRT and RLT-SPRT when the corresponding quantization schemes 
have a number of quantization levels that depends on the error probabilities. 

A. Analysis of Centralized SPRT, Q-SPRT and RLT-SPRT 

With continuous-time and continuous-path observations at the SUs, it is known that RLT-SPRT, using 
only 1-bit achieves order-2 asymptotic optimality [16], whereas Q-SPRT cannot enjoy any type of 

2 A quick reminder for the definitions of the notations o(-), O(-) and €>(-) : f( x ) = ° (g{ x )) if f( x ) grows with a lower rate 
than g(x); f(x) = O (g(x)) if f(x) grows with a rate that is no larger than the rate of g(x)\ and f(x) — O (g(x)) if f(x) 
grows with exactly the same rate as g(x). Thus o(l) represents a term that tends to 0. Particularly for this case we will write 
o x (l) to indicate a quantity that becomes negligible with x and o x , y (l) to indicate a quantity that becomes negligible either 
with x or with y or with both. 
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optimality by using fixed number of bits (22. 

In discrete time the corresponding analysis of the three sequential schemes of interest becomes more 
involved, basically due to the over(under)shoot effect. This is particularly apparent in RLT-SPRT where 
because of the over(under)shoots, 1-bit communication is no longer capable of assuring order-2 asymp- 
totic optimality as in the continuous-time and continuous-path case. In order to recover this important 
characteristic in discrete time, we are going to use the enhanced quantization/communication scheme 
proposed in Section llll-CI Let us now consider in detail each test of interest separately. 

In discrete time, for the optimum centralized SPRT, we have the following lemma that provides the 
necessary information for the performance of the test. 

Lemma 2. Assuming that the two error probabilities a, (3 —> at the same rate, the centralized SPRT, 
S, satisfies 

Eo[5] > -^i{a,p) = +op(l); E 1 [S] > j^Hfaa) = 1^1 + ^(1), (37) 

where %(x,y) = x\ogj^ + (1 — x) log and lj = -^|Ej[Li]|, i = 0,1 are the average Kullback- 
Leibler information numbers of the process {L t } under the two hypotheses. 

Proof: It should be noted that these inequalities become equalities in the continuous-time continuous- 
path case. The proof can be found in |[22l Page 21]. ■ 
Let us now turn our attention to the two decentralized schemes, namely the classical Q-SPRT and the 
proposed RLT-SPRT. We have the following theorem that captures the performance of Q-SPRT. 
Theorem 1. Assuming that the two error probabilities a, (3 — > at the same rate, and that the number 
f of quantization levels increases with a, (3, then the performance of Q-SPRT, S, as compared to the 
optimum centralized SPRT, S, satisfies 

< Ex [S] - E 1 [S\ < + o r -(l)} + tU\ + 0f (l)} + o a (l) 

Kk r li 

911 flU ch (38) 

< E [5] - E [5] < + of (1)} + rf {1 + o r -(l)} + 0p (l). 

Proof: The proof can be found in Appendix A. ■ 
As with the classical scheme, let us now examine the behavior of the proposed test when the number of 
quantization levels increases as a function of the two error probabilities a, (3. We have the next theorem 
that summarizes the behavior of RLT-SPRT. 

Theorem 2. Assuming that the two error probabilities a, f3 — > at the same rate, and that the number 
f of quantization levels increases with a, f3, then the performance of RLT-SPRT, S, as compared to the 
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optimum centralized SPRT, S, satisfies 

< E 1 [S] - Ei[S] < - ' lQ + ga ' + OA,f (1)} + f (A + 0) + o A) *(l) + oo(l), 

i-CliAtanh(4?) max{r, 1} li 

(39) 

< E [5] - E [S] < ' log/? ' ^— {1 + o A , f (1)} + f (A + <f>) + OA , f (1) + 0/3 (l). 

i^l Atanh(^j maxjr, 1} l 

Proof: The proof is presented in Appendix B. ■ 
B. Comparisons 

In order to make fair comparisons, the two decentralized schemes need to satisfy the same commu- 
nication constraints. First, each SU is allowed to use at most s bits per communication. This means 
that the number of quantization levels r in Q-SPRT must satisfy f = 2 s while for RLT-SPRT we have 
2(1 + r) = 2 s suggesting that r = 2 S ~ 1 — 1. 

The second parameter that needs to be specified is the information flow from the SUs to the FC. Since 
receiving more messages per unit time increases the capability of the FC to make a faster decision, it 
makes sense to use the average rate of received messages by the FC as a measure of the information 
flow. In Q-SPRT, every T units of time the FC receives, synchronously, K messages (from all SUs), 
therefore the average message rate is Computing the corresponding quantity for RLT-SPRT is less 
straightforward. Consider the time interval [0, t] and denote with Mt the total number of messages received 
by the FC until t. We clearly have Mt = Y2k=i where M^ is the number of messages sent by the 
fcth SU. We are interested in computing the following limit 

lim ^ = lim Y = Y lim - k = V-^-, (40) 

k=l k=l JIf\l^n=l l n l n-l) + Jjf\ l k=l U lJ 

where we recall that {t^} is the sequence of sampling times at the fcth SU and for the last equality 
we used the Law of Large Numbers since when t — > oo we also have Mf — > oo. Consequently we 
need to select A so that Ylk=i E-[ffi = T- ^° °htain a convenient formula we are going to become 
slightly unfair for RLT-SPRT. From d65l ) in Lemma[7]we have that ^-j^ < Jtanh(^) ' which means that, 
Ef=i eTM ^ Atanh(l-) - Therefore, if we set ^^Mf) = T or ' e q ui v al ently, Atanh(A) = T\ u the 
average message rate of RLT-SPRT becomes slightly smaller than the corresponding of Q-SPRT. Note 
that the average Kullback-Leibler information numbers, = 0,1, can be once computed offline via 
simulations. 

Under the previous parameter specifications, we have the following final form for the performance of 
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the two schemes. For Q-SPRT 



< E 1 [S] - E 1 [S] < ^^^{1 + o.(l)} + T^{1 + .(1)} + 0a (l) 
< E [5] - E [<S] < ^2 ^^{1 + o s (l)} + T^{1 + ,(1)} + o p {l) 



(41) 



(42) 



while for RLT-SPRT 

< 60 - E l[S ] < ^ --^gL d + 0T „(1) } + r + i + ot,.(1) + «.(!), 

< E„[5] - E [5] < ^ ^L, H + <*,(!)> + T + * + OT „(1) + „«(!). 

Comparing (|4TT) with (l42l there is a definite resemblance between the two cases. However in RLT- 
SPRT we observe the factor A in the first term of the right hand side which, as we will immediately 
see, produces significant performance gains. Since T is the communication period, and we are in discrete 
time, we have T > 1. Actually, for the practical problem of interest we have T> 1 suggesting that the 
first term in RLT-SPRT is smaller by a factor T, which can be large. 

For fixed T and s, none of the two schemes is asymptotically optimum even of order- 1. However, in 
RLT-SPRT we can have order- 1 asymptotic optimality when we fix the number of bits s and impose 
large communication periods. Indeed using 071 ) of Lemma[2] we obtain 

Ei [51 Ei[«Sl - Ex [S] /1\ TKU 
1 <ET^T = 1 + rci — <1 + Q - +- Uo a (l), (43) 



Ei [5] Ei [5] " \TJ | log a 

consequently selecting T — > oo but -> we assure order- 1 optimality. It is easy to verify that 

the best speed of convergence towards 1 of the previous right hand side expression is achieved when 



T = e(^bg^). 

We should emphasize that similar order- 1 optimality result, just by controlling the period T, cannot be 
obtained in Q-SPRT, and this is due to the missing factor A in (|4TT) . Consequently this is an additional 
indication (besides the continuous-time case) that the proposed scheme is more efficient than the classical 
decentralized Q-SPRT. 

Let us now examine how the asymptotic optimality properties of the two methods improve when we 
allow the number of bits s to grow with a, (3, while keeping T constant. Note that in the case of Q-SPRT 
selecting 2 S_1 = |loga| or, equivalently, s = 1 + log 2 |loga| assures order-2 asymptotic optimality. 
For RLT-SPRT, using for simplicity the approximation 2 S_1 — 1 « 2 S_1 , the same computation yields 
s = 1 + log 2 | log a I — log 2 T. Of course the two expressions are of the same order of magnitude, however 
in RLT-SPRT the additional term — log 2 T, for all practical purposes, can be quite important resulting in 
a need of significantly less bits than Q-SPRT to assure order-2 asymptotic optimality. The conclusions 
obtained through our analysis, as we will see in the next section, are also corroborated by our simulations. 
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V. Simulation Results 

In this section, we provide simulation results to evaluate the performance of the proposed cooperative 
spectrum sensing technique based on level-triggered sampling and that based on conventional uniform 
sampling, and how the two tests compare with the optimum centralized scheme. In the simulations, the 
sampling period of the uniform sampling is set as T = 4. For the level triggered sampling, we adjust 
the local threshold A so that the average rate of received messages by the FC matches that of uniform 
sampling, i.e. Y^k=\ e~TP] = T ( see Section IIV-Bb . The upper-bound <f> for overshoot values is set as the 
10" 4 quantile of the LLR of a single observation which is computed once offline via simulations. We 
mainly consider a cognitive radio system with two SUs, i.e., K = 2, but the effect of increasing user 
diversity is also analyzed in the last subsection. All results are obtained by averaging 10 4 trials and using 
importance sampling to compute probabilities of rare events. We primarily focus on the energy detector 
since it is the most widely used spectrum sensing method. The results for the spectral shape detector and 
the Gaussian detector are quite similar. In the subsequent figures average sensing delay performances are 
plotted under Hi. 

Fixed SNR and K, varying a, /?: We first verify the theoretical findings presented in Section [TV] on 
the asymptotic optimality properties of the decentralized schemes. We assume two SUs operate in the 
system, i.e. K = 2. For the energy detector, we set the receiver SNR for each SU to 5 dB and vary the 
error probabilities a and j3 together between 10 _1 and 10~ 10 . 

Fig. Q] illustrates asymptotic performances of the decentralized schemes using 1,2, 3 and oo number 
of bits. Our first interesting result is the fact that by using a finite number of bits we can only achieve a 
discrete number of error rates. Specifically, if a finite number of bits is used to represent local incremental 
LLR packages, then there is a finite number of possible values to update the global running LLR (e.g. for 
one bit we have ±A). Hence, the global running LLR, which is our global test statistic, can assume only a 
discrete number of possible values. This suggests that any threshold between two consecutive LLR values 
will produce the same error probability. Consequently, only a discrete set of error probabilities (a, /3) are 
achievable. Increasing the number of bits clearly increases the number of available error probabilities. 
With infinite number of bits any error probability can be achieved. The case of infinite number of bits 
corresponds to the best achievable performance for Q-SPRT and RLT-SPRT. Having their performance 
curves parallel to that of the optimum centralized scheme, the oo-bit case for both Q-SPRT and RLT- 
SPRT exhibits order-2 asymptotic optimality. Recall that both schemes can enjoy order-2 optimality if 
the number of bits tends to infinity with a rate of log | loga|. 
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(c) 



Fig. 1. Average detection delay vs error probabilities (a,/3) for optimum centralized and Q-SPRT, RLT-SPRT with l,2,3,oo 
number of bits. 



It is notable that the performance of RLT-SPRT with a small number of bits is very close to that of 
oo-bit RLT-SPRT at achievable error rates. For instance, the performance of 1-bit case coincides with 
that of oo-bit case, but only at a discrete set of points as can be seen in Fig.[T|-b. However, we do not 
observe this feature for Q-SPRT. Q-SPRT with a small number of bits (especially one bit) performs 
significantly worse than oo-bit case Q-SPRT as well as its RLT-SPRT counterpart. In order to achieve a 
target error probability that is not in the achievable set of a specific finite bit case, one should use the 
thresholds corresponding to the closest smaller error probability. This incurs a delay penalty in addition 
to the delay of the oo-bit case for the target error probability, demonstrating the advantage of using more 
bits. Moreover, it is a striking result that 1-bit RLT-SPRT is superior to oo-bit Q-SPRT at its achievable 
error rates, which can be seen in Fig.[I|-c. 

Fig. |2] corroborates the theoretical result related to order- 1 asymptotic optimality that is obtained in 
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2 4 6 8 10 12 14 16 18 20 



]loga| = |log/3| 

Fig. 2. Average detection delay normalized by the optimum centralized performance vs. error probabilities (a, /3) for Q-SPRT 
and RLT-SPRT with 2 bits and communication period either T — 4 or T = 0(y| log a|). 

(|43T >. Using a fixed a number of bits, s = 2, the performance of RLT-SPRT improves and achieves 
order- 1 asymptotic optimality, i.e. = 1 + o(l), as the communication period tends to infinity, T = 
9(y| log a|). Conversely, the performance of Q-SPRT deteriorates under the same conditions. Although 
in both cases Q-SPRT converges to the same performance level, its convergence speed is significantly 
smaller in the increasing T case, which can be obtained theoretically by applying the derivation in (l43l) 
to (|4TT ). This important advantage of RLT-SPRT over Q-SPRT is due to the fact that the quantization 
error (overshoot error) observed by SUs at each communication time in RLT-SPRT depends only on the 
LLR of a single observation, but not on the communication period, whereas that in Q-SPRT increases 
with increasing communication period. Consequently, quantization error accumulated at the FC becomes 
smaller in RLT-SPRT, but larger in Q-SPRT when T = 0(yjTog~a[) compared to the fixed T case. 
Note in Fig.[2] that, as noted before, only a discrete number of error rates are achievable since two bits 
are used. Here, we preferred to linearly combine the achievable points to emphasize the changes in the 
asymptotic performances of RLT-SPRT and Q-SPRT although the true performance curves of the 2-bit 
case should be stepwise as expressed in Fig.Q] 

Fixed a, (3 and K, varying SNR: Next, we consider the sensing delay performances of Q-SPRT and 
RLT-SPRT under different SNR conditions with fixed a = (3 = 1CT 6 and K = 2. In Fig.[3j it is clearly 
seen that at low SNR values there is a huge difference between Q-SPRT and RLT-SPRT when we use 
one bit, which is the most important case in practice. This remarkable difference stems from the fact that 
the one bit RLT-SPRT transmits the most part of the sampled LLR information (except the overshoot), 
whereas Q-SPRT fails to transmit sufficient information by quantizing the LLR information. Moreover, 
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as we can see the performance of the 1-bit RLT-SPRT is very close to that of the infinite bit case and the 
optimum centralized scheme. At high SNR values depicted in Fig.[3]-b, schemes all behave similarly, but 
again RLT-SPRT is superior to Q-SPRT This is because the sensing delay of Q-SPRT cannot go below 
the sampling interval T = 4, whereas RLT-SPRT is not bounded by this limit due to the asynchronous 
communication it implements. 





SNR(dB) 
(a) 



SNR(dB) 

(b) 



Fig. 3. Average detection delay vs SNR for optimum centralized and Q-SPRT, RLT-SPRT with l,oo number of bits. 

Fixed SNR, a and (3, varying K: We, then, analyze the case where the user diversity increases. In 
Fig.|4l it is seen that with increasing number of SUs, the average sensing delays of all schemes decay 
with the same rate of 1/K as shown in Section [W] (cf. (l37l ). (|38T ) and (l39ll). The decay is more notable 




Number of SUs K 

Fig. 4. Average detection delay vs number of SUs (K) for optimum centralized and Q-SPRT, RLT-SPRT with l,oo number 
of bits. 

for the 1-bit case because the overshoot accumulation is more intense, but very quickly becomes less 



May 17, 2012 



DRAFT 



24 



pronounced as we increase the number of SUs. It is again interesting to see that the 1-bit RLT-SPRT is 
superior to the oo-bit Q-SPRT for values of K greater than 3. 

Fixed SNR, K and Ej[7~], varying P F and P M : Finally, we show the operating characteristics of the 
schemes for fixed SNR= —3 dB, K = 2 and Ej[T]. False alarm (P F ) and misdetection (Pm) proba- 
bilities of the schemes are given in Fig.[5] when they have exactly the same three average delay pairs, 
(Eo[7~], Ei[7~]). Note that no target error probabilities,^ and /3, are specified in this set of simulations. 
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Fig. 5. False alarm probability (Pf) vs. misdetection probability (Pm) for optimum centralized and Q-SPRT, RLT-SPRT with 
l,oo number of bits. 



Thresholds, A and B, are dictated by the given average delays. We again observe here that 1-bit Q-SPRT 
performs considerably worse than 1-bit RLT-SPRT and the other schemes. Its error probability pairs are 
far away from the group of others. On the other hand, error probability pairs of 1-bit RLT-SPRT are 
close to those of the oo-bit schemes and the optimum centralized scheme. 

VI. Conclusion 

We have proposed and rigorously analyzed a new spectrum sensing scheme for cognitive radio net- 
works. The proposed scheme is based on level-triggered sampling which is a non-uniform sampling 
technique that naturally outputs 1 bit information without performing any quantization, and allows SUs 
to communicate to the FC asynchronously. Therefore, it is truly decentralized, and it ideally suits the 
cooperative spectrum sensing in cognitive radio networks. With continuous-time observations at the SUs 
our scheme achieves order-2 asymptotic optimality by using only 1 bit. However, its conventional uniform 
sampling counterpart Q-SPRT cannot achieve any type of optimality by using any fixed number of bits. 
With discrete-time observations at the SUs our scheme achieves order-2 asymptotic optimality by means 
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of an additional randomized quantization step (RLT-SPRT) when the number of bits tends to infinity at 
a considerable slow rate, 0(log | loga|). In particular, RLT-SPRT needs significantly less number of bits 
to achieve order-2 optimality than Q-SPRT With a fixed number of bits, unlike Q-SPRT our scheme can 
also attain order-1 asymptotic optimality when the average communication period tends to infinity at a 
slower rate than | loga|. 

Simulation results showed that with a finite number of bits only a discrete set of error probabilities 
are available due to the updating of the global running LLR with a finite number of possible values. 
RLT-SPRT, using 1 bit, performs significantly better than 1-bit Q-SPRT, and even better than oo-bit Q- 
SPRT at its achievable error rates. We also provided simulation results for varying SNR conditions and 
increasing SU diversity. Using 1 bit RLT-SPRT performs remarkably better than 1-bit Q-SPRT. It also 
attains the performance of oo-bit Q-SPRT at low SNR values, and even outperforms oo-bit Q-SPRT for 
SNR greater than 3 dB or when the number of SUs exceeds 3. 

Appendix A 

In this appendix we are going to prove the validity of TheoremQ] Let us first introduce a technical 
lemma. 

Lemma 3. If A,B are the thresholds of Q-SPRT with sampling period T and r quantization levels, then 
for sufficiently large r we have 

I<^; £<^, (44) 

Po Pi 

where po,pi > are the solutions of the equations Eo[e PoLT ] = 1 and Ei[e~ PlLT ] = 1 respectively. 
Furthermore we have pi > 1 — A{1 + °f (1)} where, we recall, lj = -^|E,,[Li]| and (ft is the maximum 
of the absolute LLR \l\\ of a single observation at any SU, as defined in (1101) . 

Proof: We will only show the first inequality in (l44l since the second can be show in exactly the 
same way. We recall that the two thresholds A, B are selected so that the two error probabilities are 
satisfied with equality. In particular we have a = Po(Lg > A). 

From the definition of the stopping time in ([8]) we have S = TM., that is, S is an integer multiple of 
the period T. Note now that L m T = Yln=i ^nT where X n x = Sfc=i ^nT- Since we have independence 
across time and SUs we conclude that the sequence {X h t} is i.i.d. under both hypotheses. 

Let po be the solution to the equation ip(p) = Eo[e pLT ] = Eo[e pAT ] = 1, with the second equality being 
true because Lt = Xt- It is e as y to see that i^{p) is a convex function of p, therefore it is continuous. 
For p = it is equal to 1 and as p — > oo, it tends to oo as well. If we take its derivative with respect to p 
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at p = we obtain ^'(0) = Eo[Ay]. For sufficiently large number of quantization levels A*r approximates 
At consequently ^>'(0) « Eo[Ay] = TEo[Ai] < which is negative. This implies that, at least close to 
and for positive values of p, the function ip(p) is decreasing and therefore strictly smaller than 1. Since 
we have values of p for which ip{p) is smaller than 1 and other values for which it is larger than 1, 
due to continuity, there exists po > for which fp(po) = 1. In fact this po > is also unique due to 
convexity. 

For any integer m > we have L m T = Y^n=i ^nT and, due to the definition of po and the fact that 
{A n r} is an i.i.d. sequence we conclude that { e p ° LmT } is a positive martingale which suggests that it 
is also a positive supermartingale. This allows us to apply the optional sampling theorem for positive 
submartingales which yields Eo[e PoLjMT ] < 1 for any stopping time M. which is adapted to {L m T}, as 
for instance the one in the definition of S in (fT3l). Because of this observation we can write 



a = P (L < 5 > A) = P (poAs ^ PoA) < e~ poA E [e p " L ^} = e- poA E [e poLMT ) < e~ poA , (45) 
where for the first inequality we used the Markov inequality. Solving for A yields (1441 . 

Let us now attempt to find a lower bound for po as a function of the number f of quantization levels. 
We recall that p is the solution of the equation E [e p ^ T ] = 1. Note that E [e p ^ T ] < E [e p ( LT+TK ^}, 
where e = t, consequently the positive solution p of the equation ip{p) = EQ[e p ^ LT+TK ^} = 1 constitutes 
a lower bound for p$, that is, po > /5q. The function tp(p) is convex and tp(l) > 1 which suggests p~o < 1. 
Because of this observation we can write 

1 = Eo[e po(LT+TK ' e) } < E [e poLT+TK ' e ] = (E [e poLl+K ']) T , (46) 

with the last equality being true because Lt = h + • • • + It with the {l t } being i.i.d. thus suggesting 
E [e poLT ] = (E [e poh ]) T = (E [e PoLl ]) T since L\ = l\. From (06]) we can conclude 



1 < E [e poLl+K ' e ] = E 1 [e^°- 1 ) I ' 1+Kg ] < Ej 



K ^ ~ Ll e -{p -l)K<t>+Kl + K( P + L l c (p -l)K<t>+Kt 



2K<p 2Kcp 



'l --(p a -l)K<t>+Ke , Jp -1)K^>+Ke 



2(f> 2(j) 
The second inequality comes from the convexity of the exponential function namely g ^-e~ a + g ^e a > 

e~ a ~ +a ^~ = e z , for —a < z < a. If we call x = e^ pa ~ l ^ K ^ then the inequality in (l47l) is equivalent to 
suggesting that x is either larger than the largest root or smaller than the smallest root of the corresponding 
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equation. We are interested in the first case namely 



x = e»»- W > V i + k 02 = 1 - (g£ + o(e)) , (49) 

where for the last equality we used the approximations e e = 1 + e + o(e) and \/l + e = 1 + 0.5e + o(e). 
Taking now the logarithm, solving for po and using the approximation log(l + e) = e + o(e) we end up 
with the lower bound po > 1 — + o(e). Finally recalling that po > pa proves the desired inequality. ■ 
Proof of Theorern\]} Again we will focus on the first inequality, the second can be shown similarly. 
From the definition of S in ( fT3l ) we have Ei[«S] = TEi[.M]. From the definition of M. in ( fT3T ) and Wald's 
identity we can write 



Ei [Amt] = Ei 

consequently 



M 



.771=1 



Ei [M] E a [A T ], (50) 



^ = T Ei|W] 
Ei [At] 

Next we upper bound the previous ratio. Let us start with the denominator, for which we find the 
following lower bound 

Ei [At] = Ei [At] + Ei [At - At] > TE X [Ai] - TKe = TK{\\ — e) = TiTIi (\ - ^ , (52) 
where we recall e = t. 

r 

For the numerator we have the following upper bound 

Ei[L MT ] = Ei{L mt 1 {Lmt > A} } + Ei[Z > ,Tl { T jvtT <_,3}] ^ E i[^rl { T_ MT >A } ] 

< i + EiI^t - A)1 {Lmt > A} ] =A + Ei[{L {m _ 1)t -A + \mt)1 { l mt >A}}- (53) 
The second term in the right hand side of the first equality is negative, therefore, eliminating it yields an 
upper bound. Note now that since J\AT is the first time Ljs/ix exceeds A we necessarily have -^(/k_ i)t — 
A < 0. Also A m T = ^2k=i ^rnT — KT(j> since, as we have seen, A^ T < T(f>. These two observations 
combined with Lemma[3] and used in d53l ) suggest 

Ei [L MT ] <A + KT<p< - J 1 " 8 " 1 — +KT<P=\\oga\\l + Ul + Of (1)}) + KT<t>. (54) 

1 - + o f (1)} t li J 

Applying in (|5T| ) the previous bound and the bound in d52l , yields 

r ~ n I log a\ 1 + H 1 + Of (1)} <£ 1 I logcd f e r . d> r 

< A V V f — + ^ttt = "~mr I 1 + V 1 + ° f + r il { 1 + m 

(55) 
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Finally using (l37l ) from Lemma[2l we obtain 

< E x [5] - Ei[5] < + 0f (1)} + T^{1 + o f (1)} + 0a (l), (56) 

K\f r \i 

which is the desired inequality. ■ 

Appendix B 

Before proving Theorem|2] we need to present a number of technical lemmas. 
Lemma 4. Consider the sequence {Lt n } defined in d29l ) where {t n } is the increasing sequence of time 
instants at which the FC receives information from some SU. We then have that {e L ' n } and {e~ Ltn } are 
supermartingales in n with respect to the probability measures Po and Pi respectively where the two 
measures also account for the randomizations. 

Proof: We will show the first claim namely Eo[e Lt " | L tn l , . . . , L tl ] < e Ltn - % . It is sufficient to prove 
that £ Q [e i ^- Lt —\L tn _ l ,... 1 L tx ] < 1 and, using that Eote^+W . . . ,L tl ] < 1. 

Let {(b\,qi), . . . , (b n , q n )} denote the n messages received by the fusion center until the nth commu- 
nication time t n . Denote with {k\, . . . , k n } the indices of the corresponding transmitting SUs. Then the 
n messages given these indices are independent due to the independence of observations across time and 
across SUs. Using the tower property of expectation we can then write 

E [e h ^+^\L tn _ x ,...,L tl ] 

= E [E [e b "" ( A +^ ^ | 6^ , ^« , ),..., (fci , , 9 ^ , )] l^t^, , . . . , i tl ] 

= E [E [e b >^ A+ ^\k n ,bt,qt,Qt]} = E [E [e 6 "" ^ A+ ^\k n , 6*», g*»]] 

= E [Eo[E[e 6 «™( A+ ^)|^»,^]|M] < E [E [e L ^~ Lt ^\k n }}, (57) 
with the second equality being true due to the conditional independence of the messages; the last inequality 
due to (|30l ) of Lemma[TJ and E[-] denoting expectation with respect to the randomization. Now {e L * } is 
a martingale with respect to Po, therefore it is also a supermartingale. For stopping times T\ > T2 we 
have from optional sampling for positive supermartingales Eo[e r i \k n ,L 1 n , . . . , Lj?\ < e T ? from which 
we conclude that Eo[e ir i ~~ Lt 2 \k n ] < 1. Our lemma is proven by selecting 71 = and 7i = * 
An immediate consequence of the previous lemma and the application of optional sampling for positive 
supermartingales is the following corollary. 

Corollary 1. If M > is any stopping time which depends on the process {Lf n } and since to = and 
L = 0, we conclude that 

Eo[«Av] < = 1; Ei[e- Lt *] < e~ Lt « =1. (58) 
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In particular for the case N = infjra : Lf n (—B,A)} and recalling the definition of the RLT-SPRT 
stopping time S = tj\f in (l25l) . we have 

E [e^] < 1; Ei[e~ £ *] < 1. (59) 

Let us now find useful estimates for RLT-SPRT. 
Lemma 5. If A,B are selected in RLT-SPRT to assure error probabilities a, (3 then 

i<|loga|; B<\\og(3\, (60) 

Ei [Lg] < | log a| + Kef); - E [L S ] < | log [3\ + Kcj>. (61) 



Proof: According to our usual practice we will only show the first inequality in both cases. For (|60 
note that 

-Ac r„i.«i ^ „-A 



a = P {L S > A) < e" A E [e^] < e~ A , (62) 



where we used the Markov inequality and d59l ). 
For d6T| ) we can write 

<A + K<t><\\oga\+K<j). (63) 

The first inequality in (l63l comes from the fact that the overshoot cannot exceed the last update performed 
by the FC on its test statistic {L t }. The maximum value of this update is K<p since we can have, at 
most, all SUs transmitting information to the FC and each message is upper bounded by <fi. Of course 
for the last inequality we used (l60l) . ■ 
Lemma 6. Let {t^} be the sequence of sampling times at the kth SU and denote with A/j the number 
of samples taken up to time t. Consider an i.i.d. sequence of random variables {Cn} where each Q n is 
a bounded function of the observations y^ h +1 , • • • , y\ k such that |£ n | < M < oo. Let T be a stopping 
time which at every time instant t depends on the global information from all SUs up to time t. Then we 
have the following version ofWald's identity 

E ' 



ECn 



i >E i [Ci]E i [ATf]-2M. (64) 

71=1 

Proof: The proof can be found in Ifl6l Lemma 3]. ■ 
Next we estimate the average sampling period of each SU. 
Lemma 7. Let {t^} be the sequence of sampling times at the kth SU, with A the common parameter 
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that defines the local thresholds, then 




(65) 



(66) 



Proof: As we mentioned before, the sampling process at each SU is based on a repeated SPRT with 



Using the classical Wald identity and the known lower bounds for the corresponding average delays we 
have under Hi 



where, we recall, %{x,y) = xlog^ + (1 - x) log ±^ and a k = Po(-££* - L* k _ > A), f} k = 
P x {L% n -L k tl i < -A). 



From Wald's classical estimate of the error probabilities we have a k < e _A (l — fik) and j3 k < 
e _A (l — a k ). These two inequalities generate the following two regions of points: 1) for < (3 k < i+ e ± 
we have < a k < e~ A (l — /?&); 2) for 1 ^a < (3 k < e _A we have < a k < 1 — /3fce A . Since we 
cannot compute the exact values for the two error probabilities a k ,(3k, we will find the worst possible 
pair (a>k,/3k) within the two regions that minimizes the lower bound a^). 

The function a k ) is decreasing in both its arguments, provided that ctk + ftk < 1- Therefore when 

< Pk < ppgs we can replace a k with its maximal value e _A (l — and strengthen the inequality 
in (l67l ). The resulting lower bound H(f3k, e _A (l — as a function of is decreasing and therefore 
exhibits its minimum for f} k = jjgs ■ Similarly when 1+ 1 gA < < e _A we can replace again with 
its maximal value and strengthen the inequality. The corresponding lower bound is now Hifih, 1 — /3fce A ) 
which, as a function of is increasing, therefore the minimum appears again for f} k = 1 ^a . This 
suggests that the lower bound is minimized when = which, in both cases yields an equal value 
for namely a k = 1+ 1 eA ■ Concluding, the final lower bound is ^( 1+ 1 eA , y^k ), which is equal to 
Atanh(— ). Similarly we can show the bound under Ho- 

Proving (|66l ) is straightforward since the difference L k k — L k k is simply the quantized version of 
L k k — L k k and, by design, the quantization error does not exceed e = t. ■ 
Proof of Theorem^ We need to find an upper bound for Ei[<S]. Note that using the classical Wald 
identity we can write 



thresholds —A, A, and every time this SPRT stops we sample the incremental LLR process L k — L k k 



-i 



E a -L k k J = EifLftE^-tJU] >H{/3 k ,a k ) 



(67) 



K\ 1 E 1 [S] = Ei[L^] = Ei[L^ - L S ) + E^Lg]. 



(68) 



Let us consider the term L t — L t = ^fe=i(-^t ~~ )• For tne ^ tn we nave tne sec l uence of sampling 
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times {t^}; call M k the number of samples taken up to (and including) time t. Then we can write 



N t Mi Af, fc 



(69) 



with the equality being true because L k = L% . The first term in the right hand side is the incremental 

Af ( fc 

LLR at the kth SU before the next sampling. Since this quantity lies in the interval (—A, A) it is upper 

bounded by A. Consequently we can write 

Af t k A/? 

L k t -L k t < &+^{L\-L\_-{Ll-L\j} < A+J2{b k n (A+q k n )-b k n (A+q k n )} < A + iM t \ (70) 



n=l 



n=l 



where we recall that e = t is the maximal quantization error. Replacing t with S, taking expectation on 
both sides and summing over k yields 

E 1 [^-^]<ifA + eT 1 [A(s]. (71) 

where is the total number of messages received by the FC up to the time of stopping. 
Consider now the following expectation and use (l64b from Lemma[6] and (l66l ) from Lemma|7] 



E 1 [L|] = E 1 [4 ] = E 

s 

, .'A 
> Atanh 



'Mi 



n=l 



=i 

e) E l [Afl]-2cl> 



> E 1 [L k ]E 1 [Ml] - 2cP 



Summing over k and solving for Ei [J\fs] yields 



E IAT 1 < !lM±^ 



Replacing this in (1711 ). we obtain 

Ex[L s - La] < KA + e 



Ei[Lj] + 2if</> 
^ - " ^ " r C Atanh(A) -e' 



(72) 



(73) 



(74) 



Finally using the previous inequality in (l68l) and (loTT) from Lemma|5l yields 

30e 



tfliEi[«S] < 1 + 



Atanh(A) - % 



\oga\+K \ A + 6 + 



Atanh(A) - £ 



Subtracting the lower bound (l37l) for the optimum Ei [5] we obtain the desired estimate. ■ 
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